CONSTRUCTION OF A DATABASE OF CO -OCCURRING eMOTIFS BASED ON CONDITIONAL PROBABILITIES

نویسنده

  • PRASANTH PULAVARTHI
چکیده

Classification of a newly discovered protein into a family of proteins enables the determination of its function. The eMOTIF system identifies conserved modular domains that confer functionality or structure to proteins and allows classification of proteins into families based on the conserved domains a protein contains. A program called multeeMOTIF has been developed which analyzes eMOTIFS and determines the conditional probabilities of their occurrences to find pairs of eMOTIFS that occur together a percentage of the time. The proteins that match one eMOTIF compose a large super-family of proteins and proteins that match each additional eMOTIF compose smaller and smaller sub-families. Based on how many and which eMOTIFS an unknown protein matches, it can be assigned to the appropriate sub-family. The more eMOTIFS matched, the more specific the family assignment will be. Those pairs of eMOTIFS that always occur together or occur together a high percentage of the time are observed to be from alignments of the same protein functionality 83% of the time. This may allow assignment of function to alignments of unknown function and consequently proteins belonging to such alignments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Simulation of a Random Knockout Tournament

We consider the problem of using simulation to efficiently estimate the win probabilities for participants in a general random knockout tournament. Both of our proposed estimators, one based on the notion of “observed survivals” and the other based on conditional expectation and post-stratification, are highly effective in terms of variance reduction when compared to the raw simulation estimato...

متن کامل

Visualizing Multiple System Atrophy Studies Based on Collaboration Network and Centrality Indices in Web of Science Database

Introduction: Social network analysis is an analytical method based on graph theories that identifies relationships between individuals or factors to analyze the social structures resulted from those relationships. The objective of this study was to analyze co-authorship and co-word networks based on scientometric indicators and centrality measures in the studies on multiple atrophy system dise...

متن کامل

Developing an Integrated Simulation Model of Bayesian-networks to Estimate the Completion Cost of a Project under Risk: Case Study on Phase 13 of South Pars Gas Field Development Projects

Objective: The aim of this paper is to propose a new approach to assess the aggregated impact of risks on the completion cost of a construction project. Such an aggregated impact includes the main impacts of risks as well as the impacts of interactions caused by dependencies among them. Methods: In this study, Monte Carlo simulation and Bayesian Networks methods are combined to present a frame...

متن کامل

L2 Discourse Co-construction within the Learner’s ZPD

This study examined the effect of the ZPD-based discourse scaffolding on EFL learners' co-construction of L2 metadiscourse performing collaborative writing tasks and explored the discourse scaffolding dynamics. The participants were 160 EFL students that were assigned to four different treatment conditions: (i) formal teaching, (ii) input enhancement, (iii) non-ZPD interaction, and (iv) ZPD-bas...

متن کامل

Visualizing Multiple System Atrophy Studies Based on Collaboration Network and Centrality Indices in Web of Science Database

Introduction: Social network analysis is an analytical method based on graph theories that identifies relationships between individuals or factors to analyze the social structures resulted from those relationships. The objective of this study was to analyze co-authorship and co-word networks based on scientometric indicators and centrality measures in the studies on multiple atrophy system dise...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000